proposing a method to classify texts using data mining

نویسندگان

mohammad rostami

seyed saeed ayat

iman attarzadeh

farid saghari

چکیده

today a significant part of available data is saved in text database or text documents. the most important thing is to organize these documents. one way to organize text documents is to classify them. to classify texts is to assign text documents to their actual categories. this has two main steps, i.e. feature- and learning algorithm selection. there have been several methods suggested to classify text documents. in this paper, we propose a combined method to do this more efficiently. when selecting features, the proposed method uses filtering in order to reduce complexity and it is implemented using naïve bayes and decision tree categories. results indicate advantages of this combined method to individual classifying.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposing an Efficient Method to Classify MRI Images Based on Data Mining Techniques

Nowadays, Magnetic Resonance Images (MRI) is the most common tool for diagnosis of soft tissues. Using fully automated classification magnetic resonance images of the human brain that are important for clinical research studies, can be detect the healthy or sick person. This paper purpose enhances the classification accuracy and achieves good performance in classifying the MRI images. Automatic...

متن کامل

Learning to Classify Texts Using Positive and Unlabeled Data

In traditional text classification, a classifier is built using labeled training documents of every class. This paper studies a different problem. Given a set P of documents of a particular class (called positive class) and a set U of unlabeled documents that contains documents from class P and also other types of documents (called negative class documents), we want to build a classifier to cla...

متن کامل

Proposing an approach to calculate headway intervals to improve bus fleet scheduling using a data mining algorithm

The growth of AVL (Automatic Vehicle Location) systems leads to huge amount of data about different parts of bus fleet (buses, stations, passenger, etc.) which is very useful to improve bus fleet efficiency. In addition, by processing fleet and passengers’ historical data it is possible to detect passenger’s behavioral patterns in different parts of the day and to use it in order to improve fle...

متن کامل

Data Mining Algorithms to Classify Students

In this paper we compare different data mining methods and techniques for classifying students based on their Moodle usage data and the final marks obtained in their respective courses. We have developed a specific mining tool for making the configuration and execution of data mining techniques easier for instructors. We have used real data from seven Moodle courses with Cordoba University stud...

متن کامل

Diagnosis of diabetes by using a data mining method based on native data

Background & Aim: Detecting the abnormal performance of diabetes and subsequently getting proper treatment can reduce the mortality associated with the disease. Also, timely diagnosis will result in irreversible complications for the patient. The aim of this study was to determine the status of diabetes mellitus using data mining techniques. Methods: This is an analytical study and its databas...

متن کامل

FUZZY K-NEAREST NEIGHBOR METHOD TO CLASSIFY DATA IN A CLOSED AREA

Clustering of objects is an important area of research and application in variety of fields. In this paper we present a good technique for data clustering and application of this Technique for data clustering in a closed area. We compare this method with K-nearest neighbor and K-means.  

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of advances in computer research

ناشر: sari branch, islamic azad university

ISSN 2345-606X

دوره 6

شماره 4 2015

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023